Locally differentially private high-dimensional data synthesis
نویسندگان
چکیده
In local differential privacy (LDP), a challenging problem is the ability to generate high-dimensional data while efficiently capturing correlation between attributes in dataset. Existing solutions for low-dimensional synthesis, which partition budget among all attributes, cease be effective scenarios due large-scale noise and communication cost caused by high dimension. fact, characteristics not only bring challenges but also make it possible apply some technologies break this bottleneck. This paper presents SamPrivSyn synthesis under LDP, composed of marginal sampling module generation module. The used sample from original obtain two-way marginals. process based on mutual information, updated iteratively retain, as much possible, attributes. reconstruct synthetic dataset sampled Furthermore, study conducted comparison experiments real-world datasets demonstrate effectiveness efficiency proposed method, with results proving that can protect retain information
منابع مشابه
Optimizing Locally Differentially Private Protocols
Protocols satisfying Local Differential Privacy (LDP) enable parties to collect aggregate information about a population while protecting each user’s privacy, without relying on a trusted third party. LDP protocols (such as Google’s RAPPOR) have been deployed in real-world scenarios. In these protocols, a user encodes his private information and perturbs the encoded value locally before sending...
متن کاملLocally Differentially Private Heavy Hitter Identification
The notion of Local Differential Privacy (LDP) enables users to answer sensitive questions while preserving their privacy. The basic LDP frequent oracle protocol enables the aggregator to estimate the frequency of any value. But when the domain of input values is large, finding the most frequent values, also known as the heavy hitters, by estimating the frequencies of all possible values, is co...
متن کاملLocally Differentially Private Protocols for Frequency Estimation
Protocols satisfying Local Differential Privacy (LDP) enable parties to collect aggregate information about a population while protecting each user’s privacy, without relying on a trusted third party. LDP protocols (such as Google’s RAPPOR) have been deployed in real-world scenarios. In these protocols, a user encodes his private information and perturbs the encoded value locally before sending...
متن کاملDifferentially Private Synthesization of Multi-Dimensional Data using Copula Functions
Differential privacy has recently emerged in private statistical data release as one of the strongest privacy guarantees. Most of the existing techniques that generate differentially private histograms or synthetic data only work well for single dimensional or low-dimensional histograms. They become problematic for high dimensional and large domain data due to increased perturbation error and c...
متن کاملAdaptive Differentially Private Histogram of Low-Dimensional Data
We want to publish low-dimensional points, for example 2D spatial points, in a differentially private manner. Most existing mechanisms publish noisy frequency counts of points in a fixed predefined partition. Arguably, histograms with adaptive partition, for example Voptimal and equi-depth histograms, which have smaller bin-widths in denser regions, would provide more statistical information. H...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Science China Information Sciences
سال: 2022
ISSN: ['1869-1919', '1674-733X']
DOI: https://doi.org/10.1007/s11432-022-3583-x